KBNLresearch's Repositories

100 repositories

.github
No description
⭐ 0 🌐 Public
alto-editor
Browser based post correction tool for Alto XML files
⭐ 14 🌐 Public
Annif
Annif is a multi-algorithm automated classification and subject indexing tool for libraries, archives and museums. This repository is used for developing a production version of the system, based on ideas from the initial prototype.
⭐ 0 🌐 Public
Annif-documentation
Reports about the experiments using Annif
⭐ 0 🌐 Public
Annif_data_exp
Automatic subject assignment for KB ebooks using Annif.
⭐ 1 🌐 Public
awesome-sentiment-analysis
😀😄😂😭 A curated list of Sentiment Analysis methods, implementations and misc. 😥😟😱😤
⭐ 1 🌐 Public
bb_recog
Book back recognition
⭐ 1 🌐 Public
BERT-NER
Pytorch-Named-Entity-Recognition-with-BERT
⭐ 0 🌐 Public
Brinkeys
Automatic metadating: assigning Brinkman keywords
⭐ 0 🌐 Public
Brinkman-catalogus
The data and code accomanying my research master thesis: Exploring text mining techniques tostructure a digitised catalogue.
⭐ 0 🌐 Public
cdtestcorpus
Scripts and data for creating test CDs using different CD layouts
⭐ 2 🌐 Public
chatbot-builder-nl
No description
⭐ 3 🌐 Public
children_book_data
The data and the annotations from crowdsourcing are stored in this repository.
⭐ 0 🌐 Public
CHRONIC
Classified Historical Newspaper Images
⭐ 2 🌐 Public
dac
Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia descriptions using either a binary SVM classifier or a neural net.
⭐ 11 🌐 Public
dac-web
Web interface to manually annotate named entity mentions in newspaper articles with the correct DBpedia link(s), if any. Produces labeled data sets for training and evaluating the DAC Entity Linker.
⭐ 8 🌐 Public
dbnl
Scripts to work with the Public Domain files of DBNL: https://www.dbnl.org/letterkunde/pd/index.php
⭐ 0 🌐 Public
DBNL-canonicity
KB RiR project to Collect a corpus of Dutch novels 1800-2000 and Investigate Canonicity
⭐ 3 🌐 Public
dbnl-scripts
Scripts to scrape DBNL and work with the texts.
⭐ 0 🌐 Public
dbnl_to_dracor
XML transformation from DBNL to DraCor format
⭐ 1 🌐 Public
dbpedia-indexer
Collection of Python scripts to build a Solr index from selected Dutch and English DBpedia dumps.
⭐ 1 🌐 Public
delpher_demo
This repository contains Jupyter Notebooks, code and a test data set to replicate the analyses of the website http://delpher_demo.kbresearch.nl
⭐ 0 🌐 Public
Demosaurus
Demo web application that supports author attribution (thesaureren) and topic attribution (subject indexing). Annif is used for the latter.
⭐ 2 🌐 Public
detectDamagedAudio
Tests on how to detect damaged WAV files
⭐ 2 🌐 Public
detectStorageMediaType
Storage media type detection using Python and the Windows API
⭐ 0 🌐 Public
DHBenelux2018
DHBenelux 2018
⭐ 0 🌐 Public
dictionary-viewer
View the number of newspaper articles per year containing a user-specified minimum number of keywords.
⭐ 2 🌐 Public
digger
DIGGER dataset code
⭐ 0 🌐 Public
DiggingThroughGarbage
No description
⭐ 0 🌐 Public
diskimgr
Simple workflow tool for imaging block devices
⭐ 17 🌐 Public
dutchdracor
Dutch Drama Corpus
⭐ 0 🌐 Public
Ebook-Fixer
No description
⭐ 4 🌐 Public
ebooks-qa
Scripts for quality assessment of e-books
⭐ 3 🌐 Public
enhance_ocr
Enhance OCR of newspapers archive
⭐ 3 🌐 Public
EntangledHistories
Processing of Transkribus output using xslt and running it through Annif
⭐ 1 🌐 Public
epub2to3
Epub 2 to Epub 3 conversion workflow
⭐ 0 🌐 Public
epubPolicyTests
No description
⭐ 2 🌐 Public
epubPolicyValidate
No description
⭐ 0 🌐 Public
erfgoedbot
Facebook Messenger bot with Dutch erfgoed data, linked to Wikidata
⭐ 0 🌐 Public
Europeana-Full-Text-in-Python
Various Python scripts to assist with searching and downloading full text records via the Europeana APIs.
⭐ 1 🌐 Public
europeananp-dbpedia-disambiguation
No description
⭐ 16 🌐 Public
europeananp-ner
Named Entities Recognition Annotator Tool for Europeana Newspapers
⭐ 61 🌐 Public
forensicImagingResources
No description
⭐ 16 🌐 Public
frame-generator
Tool for extracting topics, keywords and their collocates from a Dutch corpus. Includes and extends the functionality of the Keyword Generator.
⭐ 8 🌐 Public
frame-generator-gui
Web interface for the Frame Generator.
⭐ 2 🌐 Public
gado2
Dutch/Indonesian BERT-NER setup.
⭐ 1 🌐 Public
gdmodule
Python GD module, originally by Richard Jones
⭐ 0 🌐 Public
genre-classifier
Genre classifier for Dutch historical newspaper articles.
⭐ 7 🌐 Public
genre-classifier-gui
Web interface for the genre classifier.
⭐ 2 🌐 Public
geolocatedomains
Geolocate list of web domains
⭐ 0 🌐 Public
hack4europe
Javascript based portal for searching Europeana collections and creating enrichments on the metadata.
⭐ 1 🌐 Public
Hackalod
This is the github repo of the Koninklijke Bibliotheek (KB) created for the Hackalod 2021 (https://hackalod.com/)
⭐ 1 🌐 Public
heritrix3-crawler-status-reporting-fix
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
⭐ 0 🌐 Public
image2ascii
Generate ASCII art from images
⭐ 0 🌐 Public
imgquad
IMaGe QUality Assessment for Digitisation batches
⭐ 0 🌐 Public
intro-kb-apis
Materials for the RUG workshop on the KB search and harvest APIs.
⭐ 1 🌐 Public
ipmlab
Image Portable Media Like A Boss
⭐ 3 🌐 Public
iromlab
Loader software for automated imaging of optical media with Nimbie disc robot
⭐ 36 🌐 Public
iromlab-socketclient
Socket client demo for Iromlab
⭐ 0 🌐 Public
iromlabDemobatch
Iromlab demo batch - actual image/audio files replaced with empty files
⭐ 0 🌐 Public
iromsgl
Single-disc version of Iromlab
⭐ 3 🌐 Public
isbnlib
python library to validate, clean, transform and get metadata of ISBN strings (for devs).
⭐ 0 🌐 Public
isbnlib-kb
A metadata plugin for isbnlib using the service of the KB (National Library of the Netherlands).
⭐ 1 🌐 Public
isolyzer
Verify size of ISO 9660 image against Volume Descriptor fields
⭐ 53 🌐 Public
IwI22_ARTIST
This repository contains the Jupyter Notebooks and other information as created during ICT With Industry 2022
⭐ 1 🌐 Public
jhove-rest
REST Wrappings for JHOVE
⭐ 0 🌐 Public
jp2kMagic
Magic signatures for JPEG 2000 format family + sample files
⭐ 0 🌐 Public
jp2StructCheck
JP2 file structure checker (don't use this, use jpylyzer instead)
⭐ 0 🌐 Public
jp2totiff
No description
⭐ 7 🌐 Public
jp2view
experimental java jp2 viewer using jni bindings with openjpeg2.0
⭐ 3 🌐 Public
jpeg-quality-demo
Test scripts and resources for jpeg quality assessment
⭐ 4 🌐 Public
jprofile
Automated JP2 profiling for digitisation batches
⭐ 3 🌐 Public
jpylyzer
JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was specifically created to check that a JP2 file really conforms to the format's specifications. Additionally jpylyzer is able to extract the technical characteristics of each image.
⭐ 0 🌐 Public
kb-imageviewer-client
Interactive client for the imageviewer service of the KB
⭐ 0 🌐 Public
KB-python-API
Python API for KB data-services
⭐ 19 🌐 Public
KBNLresearch.github.io
Kb NL Research misc page
⭐ 0 🌐 Public
keyword-generator
Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf scores.
⭐ 40 🌐 Public
lean-reader
Based on the readium/ts-toolkit example for experimentation with readaloud voices and highlighting
⭐ 0 🌐 Public
magic-file-java-6
Experimental Java binding for libmagic file characterisation
⭐ 1 🌐 Public
mcc
mcc
⭐ 0 🌐 Public
mobile-apps
Resources, documentation on long-term preservation and access mobile apps
⭐ 0 🌐 Public
Multimodal-Presentations
Multimodal Presentations
⭐ 0 🌐 Public
Narralyzer
Narralyzer is a narrative analyzer
⭐ 8 🌐 Public
nl-menu-resources
Various resources and documentation related to the nl-menu recovery efforts
⭐ 0 🌐 Public
oai-pmh-bulk-downloader
Bulk downloader of web resources via OAI/PMH
⭐ 4 🌐 Public
ochre
Toolbox for OCR post-correction
⭐ 121 🌐 Public
ocropus-wrapper
Simple Python wrapper for ocropus command line invocation
⭐ 4 🌐 Public
omimgr
Simple workflow tool for imaging optical media
⭐ 10 🌐 Public
omSipCreator
Create ingest-ready SIPs from batches of optical media images
⭐ 7 🌐 Public
online-convert-example-files
Mirror of file format dataset by online-convert.com
⭐ 0 🌐 Public
openjpeg-decoder-service
A java based jp2 decoder service.
⭐ 5 🌐 Public
OpenRefine-Wikibase
Files for interaction between OpenRefine and KB Wikibases
⭐ 1 🌐 Public
pdf-characterisation
Scripts and raw results of PDF characterisation experiments
⭐ 0 🌐 Public
pdfPolicyVeraPDF
Demo of policy-based validation with VeraPDF
⭐ 0 🌐 Public
pdfquad
No description
⭐ 2 🌐 Public
ProtoCST
A prototype webapplication for corpus selection, inspection and export
⭐ 1 🌐 Public
Python_introduction_summerschool
Python Introduction for the KB summerschool
⭐ 0 🌐 Public
readium-speech
💬 A TypeScript library for implementing read aloud on the Web
⭐ 0 🌐 Public
readium-ts-toolkit
A toolkit for ebooks, audiobooks and comics written in Typescript
⭐ 0 🌐 Public
readium-web
🌐 Readium Web is a toolkit for building Web Readers that support ebooks, audiobooks and comics
⭐ 0 🌐 Public